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Only galaxies bright enough and large enough to be unambiguously identified and measured are 
included in galaxy surveys used to estimate cosmic shear. We demonstrate that because gravitational 
lensing can scatter galaxies across the brightness and size thresholds, cosmic shear experiments 
suffer from lensing bias. We calculate the effect on the shear power spectrum and show that - 
unless corrected for - it will lead analysts to cosmological parameters estimates that are biased at 
the 2 — 3 o" level in DETF Stage III experiments, such as the Dark Energy Survey. 

I. INTRODUCTION 

Weak gravitational lensing has emerged as a powerful tool to probe cosmological models. Current measurements [1- 
3] already constrain the amplitude of density perturbations in the universe and the total matter density. Future surveys 
arc projected to have the power to constrain the most important parameters describing both dark energy [4, 5] and 
dark matter [6-9]. 

While the largest uncertainties in these projections are experimental systcmatics, a lingering concern is our ability 
to make predictions for basic quantities such as the two-point function to sub-percent accuracy so that theoretical 
systematics will not be an issue. A number of higher order corrections to the two-point function have been considered: 
the Born correction, source-lens coupling, reduced shear, and lens-lens coupling [10-12]. Here we study another effect 
which contaminates the power spectrum at the same level as these [13]: lensing bias. 

Galaxies arc selected in weak lensing surveys only if they are bright enough and large enough for their shapes to 
be adequately measured. Lensing affects these criteria because galaxies too faint or small to make it into the catalog 
can be promoted into the sample if they are located in regions of large magnification. This effect is inevitable as it 
is only possible to cut on observed sizes and magnitudes, and cannot be eliminated by imposing brighter magnitude 
cuts. 

To appreciate the importance of this effect, consider a cartoon universe in which all galaxies are just a little too 
faint to be included in the survey. In this case, only galaxies behind regions of large magnification would be included, 
so one would be able to estimate shear only behind foreground matter overdensities. The ensuing shear map would be 
a map of clusters! Of course, reality is much more complicated than this toy example, and many galaxies will be in the 
survey by their own merits. Moreover, the sky-dilution from lensing will compete with the effect wc just described, so 
whether matter overdensities are over-sampled or under-sampled depends on the galaxy population. Nevertheless, it 
^ , is clear that the sampling of the cosmic shear field from a typical galaxy survey will almost always be biased. In this 
' paper, we derive this lensing bias and study its effect on the shear power spectrum. We also discuss how it affects 
rS ' other shear observables. 

In Section II, we present and discuss the leading lensing bias corrections. Section III then calculates the correction 
to the shear power spectrum, while other shear observables are discussed in Section IV. We conclude in Section V. 
The appendices contain a rigorous derivation of the leading and higher order correction terms, and discuss why the 
higher order terms can be neglected for the purposes of near-future surveys. 



II. LENSING BIAS AND COSMIC SHEAR 



This section describes the leading order lensing bias effects on cosmic shear. For a rigorous derivation and treatment 
of higher order terms, see Appendices A and C. Let us consider a survey of solid angle Ail with A'tot observed galaxies 
in total, so that the observed average number density is fi, = Ntot/ AVI. 

To first order, the observed galaxy overdcnsity Jobs is given in terms of the intrinsic galaxy overdensity Sg and the 
convergence n by [14]: 
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where (7 = 2/3/ + /3r ~ 2, and /?/ and (3r are the logarithmic slopes of the flux and size distributions. 
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In the following. 6obs will always stand for the background galaxies whose shear is measured, while foreground galaxy 
overdensities we correlate with will be denoted with S^^^ for clarity. 

In the weak lensing limit, cosmic shear can be described by a spin-2 field with two independent components, defined 
relative to fixed coordinate axes cc, y (-^ 71, 72), or with respect to the separation vector 9 7t, 7x )■ In the following, 
we let 7a and 7f, stand for either of these decompositions. We work in the flat sky approximation throughout, denoting 
positions on the sky with x. 

Let 7a(*) be the shear component a measured from galaxy i. The standard estimator for shear correlation functions 
^ab = ilalb) is given by (e.g., [15]): 

iaM = ^ ^ W\l,j) W{i)w{j)^a{i)lb{j), (3) 

where the sum runs over all pairs of galaxies and the normalization is given by: 

ij 

where w[i) is the weight assigned to galaxy i, and the window function picks out galaxies separated hy 9 — d9 < 
\xi —Xj \ < 9 + d9. For the remainder of the paper, we will set all weights w{i) = 1, assuming that they are determined 
by measurement errors and intrinsic ellipticities [16], and are therefore uncorrelated with the cosmological signal. We 
also assume that the shape noise is uncorrelated with the density field. 

In Appendix B, we consider pixel-based estimators. As shown there, pixel-based estimators for the shear correlation 
function are subject to a very similar bias to the galaxy pair-based estimator above, provided that each pixel is weighted 
by inverse variance. 

We wish to take the expectation value of equation (3). To do so, we partition the survey volume into infinitesimal 
cells of equal solid angle dil so that the number of galaxies nohs{i)dil in cell i is either or 1 for all cells. Given this 
partition, we can express equation (3) as: 



where the sum is now over all cells. The normalization N can be similarly rewritten. Now, we have that nobs(«) = 
n (1 -|- 5obs(*)) where (5obs(*) is the fluctuation in the galaxy density field in cell i. Inserting these expressions into the 
above equation and taking the expectation value in the continuum limit, we find (see Appendix A for details): 

{U{0)) = + ^obs(l)]7a(l) [1 + 5obs(2)]7b(2)^ , (6) 

where we have denoted two positions separated by 9 on the sky with '1' and '2'. The quantity Af is defined in 
Appendix A and comes from the normalization by the observed number of galaxy pairs. 

The important point to note here is the fact that the non-uniform sampling of source galaxies through 1 + Sohs 
makes the estimator in equation (3) sensitive not only to the shear but also to the source galaxy overdensity and 
the lensing magnification. Operationally (with the caveat of higher-order corrections), the estimator in equation (3) 
replaces the true shear 7(1) by an "observed" shear: 

7a'" -> 7a(l + Sobs) ^la{l+Sg+ Qk) . (7) 

Hence, by expansion of equation (6) we obtain the leading corrections: 



U{0)) = {la{l)lb{2)) 

+ ( Ml) + q K{l)]^a{l)lb{2)) + (7a(l) M2) + q «(2)]7b(2)) (8) 

The correction terms are of two kinds: one involves correlations of (5g7a, i.e. intrinsic overdensities of background 
galaxies with shear by mass fluctuations in the foreground. For a sufficiently narrow redshift distribution of source 
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galaxies, this source-lens clustering is negligible, since the distribution of sources and lenses do not overlap in this 
case. In case of the lensing skewness and kurtosis, it was shown that the effect is small if the width of the source 
redshift distribution is less than 0.15 [17]. Hence, if photometric redshifts are available, source-lens clustering can be 
avoided. We will not further consider source-lens clustering in the main part of the paper. 

The second type of correction in equation (8) is due to magnification and size bias and is of the form q ■ K"/a. 
These corrections can be significant, since they correlate the shear field with the same foreground lensing field. It is 
worth noting that the leading lensing bias corrections are of exactly the same form as the reduced shear correction 
[12, 18, 19]: there, 7 7(1 -I- k) perturbatively, whereas here, we have 7 7(1 -I- qk). Hence, reduced shear 
corrections and lensing bias corrections should be considered jointly. The main difference is that the size of lensing 
bias corrections depends on the background galaxy sample via the parameter q. From now on, we consider both 
effects simultaneously, so that: 

7r-7a[l + (l + 9)«]. (9) 

Note that the normalization Af in equation (6) is relevant in canceling some higher-order terms. In particular, one 
might wonder whether the term: 

(<5,(l)<5,(2))(7a(l)76(2)), (10) 

which appears in the expansion of equation (6) might be a significant contribution. This term is however canceled 
through A/", since the shear estimator is normalized to the number of observed galaxy pairs used in the measurement 
(see also Appendix C). The contributing higher-order terms due to lensing bias which we neglected in equation (8) 
involve the shear 4-point functions. We discuss these terms in Appendix C and find that they are suppressed by 
roughly two orders of magnitude with respect to the cubic terms, i.e. they entail corrections at the level of C(10~^) of 
the shear power spectrum. Corrections of this magnitude are not expected to be of interest in the foreseeable future, 
hence we neglect them for the remainder of the paper. 



III. IMPACT ON THE POWER SPECTRUM 



In this section, we present the results of a calculation of the leading magnification effects on the shear auto- 
correlation, equation (8). Specifically, we will consider = (77*) = (7171) + (7272) for background galaxies at a 
fixed redshift of Zs = 1- The cubic corrections involve three-point functions of shear and convergence It is much more 
convenient to calculate these in Fourier space where, in the absence of B modes, the complex shear is related to the 
convergence as: 

7(f) = e^'"^* k(£). (11) 

Here, 4>i is the angle of the £ vector with the x-axis of the coordinate system. Then, the shear power coefficients C"'(£) 
are defined as: 

= («^(^>(-^^')) = i^irrsDie- i)c'^{e), (12) 

and their relation to the real-space correlation function is given by: 

cii) = J d^e^^{e)e-'^-^. (13) 

The calculation then proceeds exactly as in the case of the reduced shear correction [12, 18, 19]. In Fourier space, the 
multiplication in equation (9) turns into a convolution: 

{^im = J ^liiiHe-h) = J ^e^^'^'^K{i,Mi-i,), (14) 

where we have used equation (11). Then, the leading correction to the two-point correlator equation (12) is given by: 
S{l{e)r{i')) = 2(1 + q) J ^e'20'ie-2^" , (15) 
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FIG. 1: Relative size of the combined lensing bias and reduced shear correction AC"°(^) on the shear power, for different values 
of the flux/size count slope q. The curve for g = only shows the reduced shear correction. A single source redshift Za = 1 was 
assumed. 



where the factor of 2 comes from the two permutations. Using the definition of the convergence bispectrum, 

(^K{h)K{£2)^{e3)'j = {27T)H{h + £2 + £3)B-{e,,e2j3), (i6) 

we obtain the correction to the shear power spectrum: 

AC^i£) = - C''i£) = 2(1 + q)J ^e2^(*"i-*'')i?«(/i,/- -£). (17) 

The imaginary part of equation (17) vanishes, signahng that no B modes are produced by these 3-point terms (see 
Appendix D for a treatment of the B- modes induced by the 4-point terms). The remaining real part is: 

AC''(^) = 2(l + q) j ^cos24>,,B-{h,i -£,,-£) (18) 

Here, we have set 0£ = without loss of gcnerahty. The prcfactor 1 + g in equation (18) sums up the reduced shear 
and lensing bias corrections. 

To estimate AC"'(Z), we adopt a flat ACDM cosmology with parameters given hy h ~ 0.7, fim = 1 ^ ^^a = 0.28, 
Us ~ 0.96, CT8 = 0.85. We use equations (C17)-(C18) in Appendix C relating the shear power and bispectrum to 
the matter power spectrum and bispectrum. Further, we use the non-linear matter power spectrum according to [21] 
together with the bispectrum fitting formula from [22] . Our calculation of the reduced shear correction agrees with 
that of [12, 19] where it was shown to match the results of ray-tracing through N-body simulations. 

Figure 1 shows the relative magnitude of the cubic correction /S.C'^{£)/C'^{£) from equation (18) for a range of q 
values from to 2. for a fixed source redshift of Zg = 1. In Schmidt et al. [14], we consider a galaxy sample similar 
to the one expected for the Dark Energy Survey [DES, 23]. Measuring the slope of the galaxy size and magnitude 
distributions according to equation (2) for a range of magnitude and size cut values, we obtain g « 1 — 2 (see [14] for 
details). 

At £ ^ 1000, the cubic correction term reaches about 4% for q = 2, which is larger than what one might naively 
expect from perturbation theory. This is because of a much larger weighting of low-redshift contributions in the weak 
lensing bispectrum when compared to the power spectrum (Appendix C): the correction equation (18) is enhanced 
by the more strongly non- linear matter distribution at low z. 
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FIG. 2: Same as figure 1, except for different source redsfiift distributions: tfie lines for z = 0.5, z = 1.0 and z = 1.5 show 
tlie result for single source redshifts, while the blue dashed line shows the result for a broad redshift distribution z = Q — 1.4 
expected for the full DES galaxy sample [20]. 

We show the effect of varying source redshifts in figure 2. The relative correction to C'^{£) increases with redshift, 
although the z-dependence is quite weak. We also consider a very broad galaxy redshift distribution dN/ dz expected 
for the full DES galaxy sample [20], spanning redshifts from to 1.4. Even in this case, the magnitude of the effect 
is not affected significantly. 

IV. LENSING BIAS CORRECTIONS TO OTHER SHEAR OBSERVABLES 

While we chose the cosmic shear power spectrum as a representative example to illustrate the magnification and 
size bias effects, it is worth considering briefly the effects on other observables. In the following, we assume values of 
q = 1 — 2 as typical. 

A. Mean Shear 

Lensing bias (and reduced shear) do not affect the mean of the shear, or convergence if the convergence is estimated 
from the shear. That is, one might worry that, since regions of large k are preferentially selected, the average value of 
the shear in all pixels in a survey might be pushed to a non-zero value. This is not the case. Consider the estimator 
for the convergence: 

= J e^'"""/ d^^'e-^'-'' [cos(20,)7°'^^(f') + sm{2cl,,hf%x')] (19) 

where is the angle between £ and the a;-axis, and '^a^^{x) are the measured ellipticities. We have argued that 
inevitably 

7f^(f)-7a(a?)[l + <5obs] (20) 

so contains quadratic terms such as ja{x)K{x). Symmetry though dictates that the means of these terms, ("faix')K{x'))^ 
in equation (19) vanish: 71 is just as likely to be positive as negative so 71 /t averages to zero. The mean therefore of 
any linear combination of the shear components remains zero in the presence of the corrections considered here. 
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B. B-Modes 



Cooray and Hu [24] pointed out that corrections to the Born approximation inevitably lead to non-zero B-modes. 
Lensing bias (and reduced shear) also produce B-modes; the Gaussian contribution to the spectrum is (see Appendix 
D): 



C7^(^) = (l + <7)2 



■ sin(20r)C"^(£')C"'(K"- ^^1) sin(20r) + sin(2(/)-,^-) 



(21) 



Apart from geometric factors and the prefactor, this is of order PC'^{1) ~ 10^^ smaller than the E-mode spectrum, 
in qualitative agreement with the terms analyzed in [24]. While the factor of (1 -I- g)^ could provide a boost of 
order 10 to this B-mode power spectrum, the amplitude is still likely too small to be detected in upcoming surveys. 
Therefore, B-modes will continue to serve as excellent checks of systematic effects. Note that on very small scales, 
the non-Gaussian contribution from the trispectrum might be significant. A calculation of this contribution is much 
more involved and is left for future work. 



C. Galaxy-shear correlation 



Following an argument analogous to the one presented for the shear-shear correlation function, we can derive the 
impact of lensing bias on the galaxy-shear correlation function. The equivalent of equation (6) for the correlation of 
a background shear 7^ with foreground galaxies is: 

(4a W) = (;^'5l(l) [1 + <5obs(2)]7a(2)^ , (22) 

where Af' again is from the normalizing denominator in the estimator defined in Appendix A. Expanding equation (22), 
we obtain the following corrections to the galaxy-shear correlation: 

(4a(e)) = (<5fe(l)7a(2))+gf^(«:f^(l)7a(2)) 
+ ([<5fe(l) + g's^fe(ip^(2)^^(2)) 

+ g( [5^*^(1) + gfsA^fg(l)]K(2)7a(2)) (23) 

Here, q^^ and k^^ denote q and convergence for the foreground galaxies. The first line of equation (23) shows the 
lowest order contributions, including the lensing bias of foreground galaxies [25, 26], while the second line shows 
the corrections due to source-lens clustering which we again assume to be small. Finally, the third line shows the 
contributions due to lensing effects on the background shear. These are again similar to the corresponding reduced 
shear correction, apart from the factor of q. In total, we expect the lensing bias effects on the galaxy-shear correlation 
to be of similar size as those on the shear autocorrelation (Section III), and to scale similarly with redshift. 



D. Shear tomography 

Measuring shear correlations between different redshift slices allows for precise constraints on the expansion history 
of the Universe and the dark energy equation of state, w. Using the Fisher matrix technique, Shapiro [19] estimated 
the dark energy parameter biases incurred when neglecting the reduced shear correction. Since the change to the 
shear-shear power spectrum due to reduced shear and lensing bias scales as 1 -\- q, we expect the corresponding 
parameter biases to increase by a factor of 2 — 3 for q — 1 — 2 if these effects are neglected. For example, for a 
DES-like survey (DETF Stage-III) , we expect a biasing of w at the 2 — 3a level for a flat wCDM model. 



E. Shear variance and aperture mass 



Apart from the correlation function, shear auto-correlations are often measured in terms of top-hat variance 
and aperture mass Map(0) (e.g., [15, 27]). These estimators use window functions which weight angular scales in 
different ways. White [28] showed that the reduced shear correction has a ^ 12% effect on the top-hat shear variance 
even on large scales. This is because small scales contribute strongly to the shear variance. Following the results of 
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the previous section, we expect this effect to be amphfied by (1 + 9) to a total of ~ 24 — 36% when including both 
reduced shear and lensing bias. In case of the aperture mass, angular scales much smaller than the filter scale are 
downweighted, so that the effect on the aperture mass variance is smaller; we expect a 10 — 25% correction for 
< 4 arcmin. 



Estimates of cosmic shear suffer from lensing bias: the way one selects galaxies to estimate shear is correlated 
with the shear field itself. This correlation reflects the fact that cosmic shear and magnification are due to the same 
foreground matter along the line of sight, and magnification and size bias can scatter source galaxies into or out of 
the galaxy catalog. 

Lensing bias needs to be understood if lensing is to be used as a precision probe of the dark sector. We estimated 
that neglecting lensing bias and its cousin reduced shear when interpreting the results of a DETF Stage III cosmic 
shear experiment such as DES will lead to estimates of the dark energy equation of state which differ from the true 
value by 2-3 statistical standard deviations. Thus, this is an important systematic that needs to be addressed. In 
fact, lensing bias is likely to pollute other lensing measures even more severely: piggy-backing on the calculation of 
[28] for reduced shear alone, we estimate that lensing bias -I- reduced shear will affect shear variance and aperture 
mass variance at the 20 — 30% level. This could well be of importance to weak-lensing selected clusters, since cluster 
finding and mass estimates from weak lensing are based on estimators similar to aperture mass or top hat variance 
[29, 30]. Lensing bias is also likely to be the most significant source of cosmological B-modes in the shear field, at 
roughly ^ 10^'^ of the E-modes on small scales for the Gaussian contribution. 

Correcting for these biases should not be too difficult: the perturbative calculation presented here has been shown 
to agree with simulations [12]. While better calibration is needed, this is clearly a solvable problem, especially on 
large scales where baryons are not a factor. Moreover, one can imagine calibrating from the data itself by varying size 
and magnitude cuts to isolate q. Indeed, one possible application of lensing bias is as a calibrator for multiplicative 
and additive shear errors. We plan to explore this possibility in future work. 
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This section presents derivations of the exact expressions for the lensing bias and source clustering contributions to 
shear correlations. To keep the expressions as general as possible, it is useful to define a window function W^{x,x'), 
where x, x' stand for positions on the sky, which is normalized so that: 
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APPENDIX A: RIGOROUS DERIVATION OF SHEAR CORRECTIONS 




(Al) 



In the case of correlation functions, W picks out galaxies separated by 9±d6. More generally, other shear observables 
such as top-hat variance or aperture mass can also be written in this way. Again, we write the un-pixelized estimators 



8 



for the shear-shear correlation, £,ab{S) [equation (5)], and the galaxy-shear correlation, £,ga{S) (see Section IV C) as: 

N = J2 W'ii^j) nobs{^)dnnobsU)dn, (A3) 



UO) = 5^M^^(^,,)!^d^7 7.(^)^^^^rf^^, (A4) 

ij 

N'{t) = ^iy«(z, j)nobs(j)df^. (A5) 

j 

We now take the infinitesimal patches of equations (A2)-(A5) to the continuum limit, so that nobs(*) becomes 
n[l + dohs{x)], and write the estimators for galaxy-shear and shear-shear correlations as integrals: 

, ... ^ / d^x J <Px'[l + 5,^M]la{^) [1 + 5,U^')]lb{x') W%x, x') 

^"''^ ' Jd^x''Jd^x'''[l + SoUx'')][^ + 6obs{x''')]W'>ix'\x''') ^ ' 

^^•^ /"^2 /efg /-JN [1 + <5obs(^')]7a(^') wSz-j-j/N 



The exact expressions for the expectation values of these estimators are given by: 

^ (n^\ - f^^ '/ [1 + SoU£)ha{x;) [1 + s,U£')]jt{x') \ , 

^'"'^'>/ J AnJ \jd^x''/Anjd^x'''[i + s,us'')][i + s,us'n]w%x'',x'n/ ' ^ ' 



Ail J \J d^x"[l + S^hs{x")]WO{x, x") ^ 

Now we neglect the integrals outside of the correlators, which essentially smooth the correlation functions over the 
separations defined by the angular bin width. Using some additional notation, we can then write the expectation 
values of the quadratic estimators as follows: 

\ 1 + 2(5obs + (^obs^obs / 



l + (5obs(l) 



where we have defined: 



SobM = j d^x' 5,US')W\x,x') (A12) 
r (px 

Sobs = / ^^obs(^) (A13) 

S^Zs^s = j d?x'5obs{x)5obs{x')W\x,x') (A14) 

The first quantity is the observed overdensity averaged over an annulus around the given location. Hence, it is 
evaluated at position '1', giving the overdensity of observed galaxies in an annulus around that position. For sufficiently 
large separations 9, Jobs will be small, while it will be of order unity for separations close to the galaxy correlation 
length. Jobs is the overdensity of the galaxy sample (including magnification) averaged over the whole survey, measured 
relative to the ensemble average or an infinite- volume survey. We have not neglected this averaged overdensity for the 
sake of completeness, although for actual wide-field surveys this will be negligible. Finally, Jobs^obs is a product of 
overdensities smoothed over separations around 6. Note that this quantity is within the expectation value and hence 
cannot immediately be replaced by £,gg{0). 
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We now expand the expectation values of the correlation functions up to fourth order in 5 and k. For sake of 
brevity, we keep 5obs [equation (1)] unexpanded: 



(7a(l)76(2)) 

+ ([<5ob.(l) 7a(l)7;>(2)) + (7a(l) [<5obs(2) 76(2)) 

7a(l)7fc(2) 



'5obs(l)^obs(2) — f^obsfJobi 

-2 (Jobs [Sohs (1) + (Sobs(2)] 7a(l)7b(2)) + 2 (<5ob/7a(l)7b(2) 
%am) = (-51(1)7^(2)) 

;'5i(i)['5obs(2)-5:rs(i)]7a(2)) 
I'^lii) [^/(i) - j;rs(i)<5obs(2)] 7a(2) 

To be explicit, the term in the third line for ^ab stands for: 



(A15) 



(A16) 



7a(xi)76(x2)^ (A17) 

Note that the form of all magnification/size bias corrections in the final expressions correspond to a replacement 
of the form: 



Jobs(^l) '5obs(^2) - / d^Xs / <fx4 W^{X3,X4) (5obs(^^3) (5obs(^4) 



7(1) ^ 7(1) {1 + (5obs(l) - 5[(5obs](l)} , etc., 



(A18) 



where ^[(Sobs] is some smoothing of (5obs over annuli or the whole survey. Hence, all corrections vanish if either, 
<5obs = 'S'[(5obs], i-c. there is no observed clustering of background galaxies (including magnification effects), or if the 
shear field 7 is uncorrelated with (5obs- This holds analogously for a pixelized estimator (see Appendix B) and is in line 
with the intuititive understanding of the magnification corrections. Keeping only the cubic terms of equation (A15), 
and neglecting (Jobs, we arrive at equation (8). See Appendix C for a discussion of the four-point terms. 



APPENDIX B: PIXEL-BASED SHEAR ESTIMATORS 



An alternative approach to estimating shear correlations is to divide the survey volume into pixels a of finite volume 
defined by window functions VVaix) (normalized so that JyVa{x)(Px = 1). Each pixel contains many galaxies, and 
one estimates the shear and galaxy overdensity directly for each pixel (e.g., [15, 31]; again setting all weights to 1): 

7a(a) = ^^^nobs(*)7a(»)W„(z) (Bl) 

"pix i 

Here, 7a(«) is the shear measured from galaxy i, n^^^ is the foreground galaxy density, and we have again subdivided 
the finite-sized pixel into infinitesimal patches i, so that the observed number of galaxies nobs(*) in each patch is either 
or 1. ?ipix(a) is the number of galaxies observed in pixel a, while fipix is the expected average number of galaxies 
per pixel: 

npix(a) = ^nohs{i)yVa{i), ripix = nWg {i) = n, (B3) 

i i 

and analogously for the foreground galaxy densities. The estimators equation (Bl) and (B2) result in pixelized maps 
of the shear components and foreground galaxy overdensities, which can then be processed in real or Fourier space to 
measure shear correlations. 

Going to the continuum limit of equation (Bl) and (B2) yields: 

- ( -.^ /d^x[l+(5obs(x)]7a(x)>Vc.(x) 

^•^^"^ Jd^x'[l + 6,US')]Wg{x') ^ ' 

5{a) = I d2^!i^ii|_^>V„(x), ^ I ^nobs(^) (B5) 
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We now take expectation values of correlators of the pixelized shear and overdensity fields, neglecting the effect of 
pixelization on the correlation function (which is appropriate if the separation 6 is much larger than the pixel scale): 



{l + 6obsha{l) (l + .5ob.)7b(2) 

i + 5;;;;r(i) i + w2) , 



5oL(l) 



(l + '^obs)7a(2) 



(B6) 



(B7) 



l + '5obs(2) , 

where we have labeled two points separated by as 1 and 2. Here, barred quantities denote averages over pixels: 

X{a) = J (fx' X{x')yVaix'). (B8) 

In equation (B7) , we have neglected a correction due to the integral constraint [32] . since n is measured in the survey 
itself. However, this effect is of order the overdensity averaged over the whole survey, and hence very small for large 
surveys. In contrast, the denominators kept in equations (B6)-(B7) are integral constraints which are important, 
since they are of order of the overdensity averaged over pixel scales. 

Expanding the expectation values to fourth order, we obtain for the pixelized estimators: 

iabiO)) = (7a(l)7fc(2)) 

+ (7a(l) P^(2) - W2)7b(2)] ) + ( [S^ail] - Wl)7a(l)] 76(2)) 

+([^;;;^(i) -Qi)7a(i)] [s:^bi2) ~6~^s{2)%i2)]) 

CsaiO)} = (^1(1)7J2) 



Sobs (2)76(2) - SoU2}6obslb{2) 



(B9) 



+ (-^1(1) [<^ob.7a(2)-<5obs(2)7.(2)] 



,5ob. (2)7,(2) -,5ob.(2)<5obs7h (2) 



(BIO) 
(Bll) 



Clearly, all corrections vanish if Sohs 7 = <5obs 7 within a correlator, which is the case if either the observed galaxy 
distribution (Jobs or the shear 7 are smooth on pixel scales, or if they are completely uncorrelated. 

For the same reasons as for the unpixelized estimator, detailed in Appendix C, the quartic corrections are much 
smaller than the cubic corrections. Repeating the derivation leading to equation (18), and noting that in Fourier 
space the smoothed convergence field is given by: 



k{£) = yv(£) k(^), W(£) = / (fx Wa{x)e 



(B12) 

we obtain the following expression for the magnification correction to the shear power C^it) in case of the pixelized 
estimator: 



ACf = 2q 



2q 



(fh 
i2^Y 

(2^)2 



cos20£, w{i)-w{ii)w{-t-ii) B{e,ei,-e~ ii) 



cos20£, l-|W(£i)|^ B{eji,~e - ii). 



(B13) 
(B14) 



For the second approximate equality, we have assumed that £ <^ 1/^pix, where 0pix is the angular size of the pixels, so 
that W(^) ~ 1. The factor in square brackets in equation (B14) is the only difference in the magnification correction 
for the pixelized estimator compared to equation (18). This factor acts as a high pass, so that only modes with 
£ > l/0pix contribute to the magnification corrections, as expected from equation (B9) and our discussion above. 
Hence, for sufficiently small pixels, the magnification corrections are suppressed. 

Note, however, that our derivations assumes that in estimating shear correlations from pixelized maps, all pixels 
receive the same weight. If a weighting scheme according to signal-to-noise is used, weighting each pixel by the number 
of observed galaxies within the pixel (as appropriate for inverse variance weighting), the magnification correction is 
re-introduced, and we essentially go back to equation (18). Note that in any case the reduced shear correction is not 
suppressed by using small pixels, and is always given by equation (18). 
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APPENDIX C: QUARTIC CORRECTIONS 

Going from equation (A15) to equation (8), we have neglected several 4-point correlations. For the estimators we 
consider, all of the 4-point terms due to lensing bias are suppressed by roughly two orders of magnitude compared to 
the cubic terms, as we dicuss below. Similar conclusions hold for the 4-point terms in the pixelizcd estimator. In the 
following, we again neglect the galaxy overdcnsity averaged over the whole survey, <5obs- 

The 4-point contributions can be divided into three classes. First, there are source-lens clustering terms: 



a(u(0)) = {[5gil)S,{2) - sjMlHi2)) (CI) 

\ / quartic,! \ / connected 

+ (<5,(l)7a(l)) {6gi2hbi2)) - (<537r(l)y(<5g76(2)) (C2) 

+ {S,{1H{2)) (5,(2)7a(l)) - {S,^JI)){SM^)) (C3) 

+ {5,il)6g{2)) (7a(l)76(2)) - (7a(l)76(2)> • (C4) 



The first, connected term is related to the matter four-point fmiction. The second and third lines contain the actual 
source-lens terms. Note that for the second terms in each line, the product of the two correlators is to be integrated over 

following equation (A14). The two terms in the fourth line cancel, since (^SgSg'^ = {6g{l)Sg{2)) (see equation (A14)). 

This cancelation is a consequence of the normalizing denominator in equation (A2): when measuring the shear, we 
divide by the number of pairs of observed galaxies with given separation 9. 

The second set of quartic contributions arc mixed source-lens clustering and lensing bias terms: 



(Lb{0)) = g([5,(lM2)- Vha(l)76(2)) ^ (C5) 

\ / quartic, 11 \ / connected 

+ q{{6g{lha{l)) {<2hb{2)) - {Sg^){Kjt{2))} (C6) 

+ q {{6g{lht{2)) {n{2)Ul)) - (5,^)(K7fa(2))} (C7) 

+ q{{Sg{lU2)) (7a(l)7b(2)) - (V^) (7a(l)76(2))} (C8) 

+ {Sg{lM2) ^ Kmjm- (C9) 



As expected, these are all proportional to q. The first three lines again give the contributing source-lens cluster- 
ing/lensing bias contributions, while the terms in the fourth line cancel. 

Finally, the quartic terms from "pure" lensing bias receive two contributions: first, from the quartic terms in equa- 
tion (A15). Second, there are quadratic contributions to (Jobs from lensing bias. Expanding the lensing magnification 
A = [{1 — k)^ — l7p]~^^^ to second order, we obtain [33]^: 

Sohs = Sg+qK + ClK^ +€21^1"^, (CIO) 

where ci = q{q + l)/2, C2 = q/2, and I7P = 7i + 72- Together, we obtain the following quartic terms due to lensing 
bias: 

^(UiO)) . - 'Z'(K1)^(2)-S^]7a(l)76(2))e0„ncctcd (dl) 

\ / quartic, iii 

+ 2q'[{K{lhail)) (^^(2)7fc(2)) - (^^'jj)){nib{2))} (C12) 

+ g{(«;(l)«;(2))(7a(l)7b(2))-(^)(7a(l)7b(2))} (C13) 

+ ci(«:2(l)7.(l)7.(2))„,, + C2(|7(l)p7a(l)7.(2))„,, + {(l) - (2)} (C14) 

+ {c, {^'{1)) + C2 (17(1)0} (7a(l)7.(2)) + {(1) - (2)} (C15) 

+ 2ci(K(l)7,(l))(K(l)7fc(2)) + 2c2 ^ (7,(1)7,(1)) (7e(l)7b(2)) + {(1)^(2)}. (C16) 

c=l,2 



^ We neglect second derivatives of riots with respect to In/, Inr here. 
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Bispectrum: WJ/;^^ 
Power spectrum: Wg/;^ 
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FIG. 3: ie/t panel: Lensing weight functions for the shear power spectrum, Wl(Xs,x)'^/x (red, dashed), and the shear 
bispectrum, Wl(x3i x)'^/x'^ (blue, solid), in units of Xs. Right panel: Scaled shear power (!'^/(2tv) C"'{t) (blue, dashed), and 
equilateral bispectrum power £^/(27r) B'^{1, 1, 1) (red, solid), for Zs — 1. The thin lines show the linear/tree-level prediction, 
while the thick lines are using the non-linear fitting formulas of [21] and [22]. 



Here, {(1) <-> (2)} means that k^(1), |7(1)P are to be replaced with k^(2), |7(2)|^, respectively. Line (Cll) and (C14) 
are connected terms given by the shear four-point function. We will discuss those below. The terms in line (C13) 
cancel in the same way as the corresponding source-lens terms. The terms in lines (C12), (C15), (C16) are proportional 
to Ckk(^)^, or £,kk{0) Ckk(O). In other words, the relative magnitude of these corrections is of order ^kk(O) '^few lO"'^ 
or less. Hence, we can safely neglect them compared with the percent-level of the cubic corrections. 

In order to understand why the cubic corrections are so much more important, consider the expressions for the shear 
power spectrum (2-point function) and bispectrum (3-point function). Using the Limber or small-angle approximation, 
the shear power for sources at a fixed redshift Zg, with Xs — xi^s), can be written as a projection of the matter power 
spectrum P{k, x)' 

3o rr2V dxWLiXs,X? 



Here, x denotes comoving distance, Wl{xs,x) — x/XsiXs — x)i and a is the scale factor. Similarly, the shear 
bispectrum B'^ can be written as a projection of the matter bispectrum B{ki, k2, k^; x): 

B'(eul.M = r|n,„H»)' r'^fEii^yJXkk,) , (CIS, 

V2 J Ja X \ Xaix) J \x X X J 

In case the sources are distributed according to a broad redshift distribution, dN/dz (assumed normalized to unity), 
Wl in equations (C17)-(C18) is to be replaced with: 

1 /■°° dN 
WL,dN/d.{x) = TTT^ dz,WL{xizs),x)-r(^s). (C19) 

^(X)Jz{x) 

Now, for the 3D matter field, {S{1)S{2)S{3)) is of the same order of magnitude as (5(1)5(2)) ((5(2)5(3)) -|-cycl.. However, 
in case of the shear, which is proportional to the projected density field, B'^{£,£,£) is larger than C"*(£)^ by a factor of 
order several hundreds (right panel of figure 3). This is because the two-point and three-point functions are projected 
with a different weighting of low- 2; contributions. Figure 3 (left panel) shows the effective weight functions for 
(red, dashed) and B^ (blue, solid). Clearly, the late-time contributions receive more weight in case of the bispectrum, 
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which grows as ^ D(a)^. In addition, the low- 2; contributions along the line of sight are probed at smaller scales, 
which additionally enhances the bispectrum. For this reason, the cubic corrections dominate the disconnected quartic 
contributions in equations (C11)-(C16), even though they are formally of the same order in perturbation theory. 

The connected four-point terms, equation (Cll) and (C14), are given by the convergence trispectrum. We can 
roughly estimate the contribution from these terms relative to the cubic terms as: 



"iTT ..^ .2 - 0.05 for e < lO''. (C20) 



Here, T^^ denotes the square trispectrum, B^^^ denotes the equilateral bispectrum, and the scaled quantities A^^j, A^^ 
were defined and calculated in [34, 35]: 



A,\lW^^[i3^q,(^)]l/^ A,\(£)^^[T«W]V3. (C21) 



Note that for very small scales, or in case equation (C20) underestimates the size of the connected (non- Gaussian) 
quartic contributions, the quartic contribution in equation (Cll), (C14) are positive and act to increase the magnifi- 
cation corrections. 



APPENDIX D: B-MODES 



The new terms induced by lensing bias produce B-modes in addition to the E-modes. This is somewhat akin to the 
well-known effect of lensing of the cosmic microwave background, when large scale structure distorts the E-modes. 
Cooray and Hu [24] examined the B-modes induced by higher order corrections to the Born approximation. Lensing 
bias (and reduced shear) lead to an additional source of B-modes. The estimator for the B-mode is: 



B{i) = sm{2(j)i)^f%£) - cos(2(/),)7f ''(£) 



(Dl) 



with associated power spectrum 



(27r)2 
(2^' 



sin(2,/),)7f ^(/) - cos(20,)7f ''(/)1 \sm{2ci,,,)-if'%i) - cos(20,O72''''(^^)l )• (D2) 



In the absence of lensing corrections, ^f^'^ijt) = cos{2(j)£)K{£) and 72*^*^(1) = sm{2(j)g)K{£), so the power spectrum 
vanishes identically. Lensing effects lead to a new term when the shears are estimated: 



or in Fourier space, 



7°'^W-7a(^) + (l+9) 



72 «/ 



(2^)^ 



(D3) 



(D4) 



The second term here is the only one that survives when computing the B-mode spectrum. Inserting these into 
equation (D2) leads to 



C^(£) = {1 + qf 





r (fi" 


r dH'" 


1 (2^)2 j 


(2^)2 j 


(27r)2< 



sin(2(/)f) cosi2(l)e")Kii")K{e- I") - cos(20f) suv{2(t>i>")K{l")K{l - I") 



sui{2(t>i>) cos{24)i>n,)n{i")K{i' - 1"') - cos(20r ) sin(2<?if"0'«(^")'«(^ " ) 



d^l" 



■ s\TL{2(t)i - 2(t)iu)s\TL{2(t)i, - 2(t)i>.,){K{i")K{I - - i")). (D5) 



(27r)2 J (27r)2 J (27r)2 

Apart from the I = mode, there are two ways to contract the (assumed) Gaussian convergence fields in equation (D5) 



{K{r)K{e - r)K{£'")K{i' - r)) = (2^fb'^{i + f )c'^(r)c''(K - r|) ^''{f + a") -f (^^(r - 1 - c") 



(D6) 
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so, choosing 0^ = leads to 



■ sin(20^OC"'(^')C"'(K"- ^'D sin(2#) + sin(20,-,_,-) 



(D7) 



Apart from geometric factors, this is of order £'^C^{£) ^ 10""* smaller than the E-mode spectrum, in qualitative 
agreement with the terms analyzed in [24]. Note that here the trispectrum terms may contribute an even larger 
correction on small scales. We leave this calculation for future work. 
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